Search CORE

5,512 research outputs found

The Generalized Asymptotic Equipartition Property: Necessary and Sufficient Conditions

Author: Harrison Matthew T.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/11/2007
Field of study

Suppose a string

X_1^n=(X_1,X_2,...,X_n)

generated by a memoryless source

(X_n)_{n\geq 1}

with distribution

P

is to be compressed with distortion no greater than

D\geq 0

, using a memoryless random codebook with distribution

Q

. The compression performance is determined by the ``generalized asymptotic equipartition property'' (AEP), which states that the probability of finding a

D

-close match between

X_1^n

and any given codeword

Y_1^n

, is approximately

2^{-n R(P,Q,D)}

, where the rate function

R(P,Q,D)

can be expressed as an infimum of relative entropies. The main purpose here is to remove various restrictive assumptions on the validity of this result that have appeared in the recent literature. Necessary and sufficient conditions for the generalized AEP are provided in the general setting of abstract alphabets and unbounded distortion measures. All possible distortion levels

D\geq 0

are considered; the source

(X_n)_{n\geq 1}

can be stationary and ergodic; and the codebook distribution can have memory. Moreover, the behavior of the matching probability is precisely characterized, even when the generalized AEP is not valid. Natural characterizations of the rate function

R(P,Q,D)

are established under equally general conditions.Comment: 19 page

arXiv.org e-Print Archive

Crossref

Conservative Hypothesis Tests and Confidence Intervals using Importance Sampling

Author: Harrison Matthew T.
Publication venue
Publication date: 08/04/2011
Field of study

Importance sampling is a common technique for Monte Carlo approximation, including Monte Carlo approximation of p-values. Here it is shown that a simple correction of the usual importance sampling p-values creates valid p-values, meaning that a hypothesis test created by rejecting the null when the p-value is <= alpha will also have a type I error rate <= alpha. This correction uses the importance weight of the original observation, which gives valuable diagnostic information under the null hypothesis. Using the corrected p-values can be crucial for multiple testing and also in problems where evaluating the accuracy of importance sampling approximations is difficult. Inverting the corrected p-values provides a useful way to create Monte Carlo confidence intervals that maintain the nominal significance level and use only a single Monte Carlo sample. Several applications are described, including accelerated multiple testing for a large neurophysiological dataset and exact conditional inference for a logistic regression model with nuisance parameters.Comment: 26 pages, 3 figures, 3 tables [significant rewrite of version 1, including additional examples, title change

arXiv.org e-Print Archive

CiteSeerX

Inconsistency of Pitman-Yor process mixtures for the number of components

Author: Jeffrey W. Miller Jeffrey
Matthew T. Harrison Matthew
Whye Teh
Publication venue
Publication date: 30/08/2013
Field of study

In many applications, a finite mixture is a natural model, but it can be difficult to choose an appropriate number of components. To circumvent this choice, investigators are increasingly turning to Dirichlet process mixtures (DPMs), and Pitman-Yor process mixtures (PYMs), more generally. While these models may be well-suited for Bayesian density estimation, many investigators are using them for inferences about the number of components, by considering the posterior on the number of components represented in the observed data. We show that this posterior is not consistent --- that is, on data from a finite mixture, it does not concentrate at the true number of components. This result applies to a large class of nonparametric mixtures, including DPMs and PYMs, over a wide variety of families of component distributions, including essentially all discrete families, as well as continuous exponential families satisfying mild regularity conditions (such as multivariate Gaussians).Comment: This is a general treatment of the problem discussed in our related article, "A simple example of Dirichlet process mixture inconsistency for the number of components", Miller and Harrison (2013) arXiv:1301.270

arXiv.org e-Print Archive

CiteSeerX

Exact Enumeration and Sampling of Matrices with Specified Margins

Author: Harrison Matthew T.
Miller Jeffrey W.
Publication venue
Publication date: 02/04/2011
Field of study

We describe a dynamic programming algorithm for exact counting and exact uniform sampling of matrices with specified row and column sums. The algorithm runs in polynomial time when the column sums are bounded. Binary or non-negative integer matrices are handled. The method is distinguished by applicability to non-regular margins, tractability on large matrices, and the capacity for exact sampling

arXiv.org e-Print Archive

CiteSeerX

Lattice QCD calculation of the ${{B}_{(s)}\to D_{(s)}^{*}\ell{\nu}}$ form factors at zero recoil and implications for ${|V_{cb}|}$

Author: Davies Christine
Harrison Judd
Wingate Matthew
Publication venue: 'American Physical Society (APS)'
Publication date: 01/03/2018
Field of study

We present results of a lattice QCD calculation of

B\to D^*

and

B_s\to D_s^*

axial vector matrix elements with both states at rest. These zero recoil matrix elements provide the normalization necessary to infer a value for the CKM matrix element

|V_{cb}|

from experimental measurements of

\bar{B}^0\to D^{*+}\ell^-\bar{\nu}

and

\bar{B}^0_s\to D_s^{*+}\ell^-\bar{\nu}

decay. Results are derived from correlation functions computed with highly improved staggered quarks (HISQ) for light, strange, and charm quark propagators, and nonrelativistic QCD for the bottom quark propagator. The calculation of correlation functions employs MILC Collaboration ensembles over a range of three lattice spacings. These gauge field configurations include sea quark effects of charm, strange, and equal-mass up and down quarks. We use ensembles with physically light up and down quarks, as well as heavier values. Our main results are

\mathcal{F}^{B\to D^*}(1)= 0.895\pm 0.010_{\mathrm{stat}}\pm{{0.024}_{\mathrm{sys}}}

and

\mathcal{F}^{B_s\to D_s^*}(1)= 0.883\pm 0.010_{\mathrm{stat}}\pm{0.028_{\mathrm{sys}}}

. We discuss the consequences for

|V_{cb}|

in light of recent investigations into the extrapolation of experimental data to zero recoil.Comment: 23 pages. v3: Typos corrected. v2: Improved treatment of finite volume effects. Small change to some results (but smaller than the quoted uncertainties). Version accepted for publication in Phys. Rev.

arXiv.org e-Print Archive

Enlighten